Bayesian Inference under Cluster Sampling with Probability Proportional to Size
نویسندگان
چکیده
Cluster sampling is common in survey practice, and the corresponding inference has been predominantly design-based. We develop a Bayesian framework for cluster sampling and account for the design effect in the outcome modeling. We consider a two-stage cluster sampling design where the clusters are first selected with probability proportional to cluster size, and then units are randomly sampled inside selected clusters. Challenges arise when the sizes of nonsampled cluster are unknown. We propose nonparametric and parametric Bayesian approaches for predicting the unknown cluster sizes, with this inference performed simultaneously with the model for survey outcome. Simulation studies show that the integrated Bayesian approach outperforms classical methods with efficiency gains. We use Stan for computing and apply the proposal to the Fragile Families and Child Wellbeing study as an illustration of complex survey inference in health surveys.
منابع مشابه
Bayesian inference for the finite population total from a heteroscedastic probability proportional to size sample
We study Bayesian inference for the population total in probability-proportional-to-size (PPS) sampling. The sizes of non-sampled units are not required for the usual Horvitz-Thompson or Hajek estimates, and this information is rarely included in public use data files. Zheng and Little (2003) showed that including the non-sampled sizes as predictors in a spline model can result in improved poin...
متن کاملInference for the Proportional Hazards Family under Progressive Type-II Censoring
In this paper, the well-known proportional hazards model which includes several well-known lifetime distributions such as exponential,Pareto, Lomax, Burr type XII, and so on is considered. With both Bayesian and non-Bayesian approaches , we consider the estimation of parameters of interest based on progressively Type-II right censored samples. The Bayes estimates are obtained based on symmetric...
متن کاملCost Analysis of Acceptance Sampling Models Using Dynamic Programming and Bayesian Inference Considering Inspection Errors
Acceptance Sampling models have been widely applied in companies for the inspection and testing the raw material as well as the final products. A number of lots of the items are produced in a day in the industries so it may be impossible to inspect/test each item in a lot. The acceptance sampling models only provide the guarantee for the producer and consumer that the items in the lots are acco...
متن کاملLatent Dirichlet Bayesian Co-Clustering
Co-clustering has emerged as an important technique for mining contingency data matrices. However, almost all existing coclustering algorithms are hard partitioning, assigning each row and column of the data matrix to one cluster. Recently a Bayesian co-clustering approach has been proposed which allows a probability distribution membership in row and column clusters. The approach uses variatio...
متن کاملDynamic importance sampling in Bayesian networks using factorisation of probability trees
Factorisation of probability trees is a useful tool for inference in Bayesian networks. Probabilistic potentials some of whose parts are proportional can be decomposed as a product of smaller trees. Some algorithms, like lazy propagation, can take advantage of this fact. Also, the factorisation can be used as a tool for approximating inference, if the decomposition is carried out even if the pr...
متن کامل